# 128K Long Context
Devstral Small 2505 Fp8
Apache-2.0
Devstral is a large language model agent for software engineering tasks developed by Mistral AI in collaboration with All Hands AI, excelling in exploring codebases with tools, editing multiple files, and driving software engineering agents.
Large Language Model
Safetensors Supports Multiple Languages
D
bullerwins
243
1
Devstral Small 2505
Apache-2.0
Devstral is an intelligent large language model specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, multi-file editing, and driving software engineering agents.
Large Language Model
Safetensors Supports Multiple Languages
D
unsloth
317
11
Llama 3.1 Nemotron Nano 4B V1.1
Other
Llama-3.1-Nemotron-Nano-4B-v1.1 is a large language model derived from Llama 3.1 8B through compression, optimized for inference efficiency and task execution, suitable for local deployment on a single RTX GPU.
Large Language Model
Transformers English

L
unsloth
219
4
Devstral Small 2505 Bnb 4bit
Apache-2.0
Devstral is an intelligent large language model specifically designed for software engineering tasks, developed in collaboration by Mistral AI and All Hands AI. It excels in codebase exploration, multi-file editing, and driving software engineering agents.
Large Language Model
Safetensors Supports Multiple Languages
D
unsloth
465
3
Medgemma 27b Text It GGUF
Other
MedGemma is a series of medical-specialized AI models optimized based on Gemma 3, with the 27B version focusing on medical text comprehension and reasoning tasks
Large Language Model
Transformers

M
unsloth
9,953
18
Devstral Small 2505 Gguf
Apache-2.0
Devstral is an intelligent large language model specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, editing, and driving software engineering agents.
Large Language Model Supports Multiple Languages
D
mistralai
8,964
44
Typhoon2.1 Gemma3 4b
Thai large language model (instruction-tuned version) with 4 billion parameters, 128K context length, and function calling capability
Large Language Model
Safetensors
T
scb10x
2,083
3
Typhoon2.1 Gemma3 12b
Typhoon2.1-Gemma3-12B is a 12-billion-parameter Thai large language model based on the Gemma3 architecture, supporting 128K context length and function calling capabilities.
Large Language Model
T
scb10x
159.13k
2
Gemma 3 12b It Qat Int4 GGUF
Gemma 3 is Google's lightweight open model series based on Gemini technology. The 12B version employs Quantization-Aware Training (QAT) technology, supports multimodal input, and features a 128K context window.
Text-to-Image
G
unsloth
1,921
3
Phi 4 Mini Instruct.gguf
MIT
Phi-4-mini-instruct is a lightweight open-source model focused on high-quality, reasoning-rich data, supporting a context length of 128K tokens.
Large Language Model Other
P
Mungert
13.08k
25
Gemma 3 27b It Qat GGUF
Gemma 3 is a lightweight open model series built by Google based on Gemini technology, supporting multimodal input and text output, featuring a 128K large context window and support for 140+ languages.
Text-to-Image English
G
unsloth
2,683
3
Gemma 3 12b It Qat Int4
Gemma 3 is a lightweight open model series from Google, built on the research and technology used to create Gemini models. The 12B version is an instruction-tuned multimodal model supporting text and image inputs to generate text outputs.
Image-to-Text
Transformers

G
unsloth
78
1
R01 Gemma 3 1b It
Gemma 3 is a lightweight open-source multimodal model introduced by Google, built on the same technology as Gemini, supporting text and image inputs to generate text outputs.
Text-to-Image
Transformers English

R
EpistemeAI
17
1
Gemma 3 1b It Qat Q4 0 Unquantized
Gemma 3 is a lightweight open-source multimodal model series developed by Google, built on Gemini technology, supporting text and image inputs with text outputs. The 1B version has undergone instruction tuning and quantization-aware training (QAT), making it suitable for deployment in resource-constrained environments.
Image-to-Text
Transformers

G
google
246
4
Gemma 3 12b It Qat Q4 0 Unquantized
Gemma 3 is Google's lightweight open-source multimodal model series based on Gemini technology, supporting text and image inputs with text outputs. The 12B version undergoes instruction tuning and quantization-aware training (QAT), making it suitable for deployment in resource-limited environments.
Text-to-Image
Transformers

G
google
1,159
10
Gemma 3 4b It Qat Q4 0 GGUF
Gemma is a family of lightweight, cutting-edge open models introduced by Google, built on the same research and technology as the Gemini models. Supports text and image inputs and generates text outputs.
Text-to-Image
G
Mungert
713
2
Gemma 3 27b It Qat Autoawq
Gemma 3 is a lightweight, cutting-edge open model series from Google, built on the same technology as Gemini, supporting multimodal input (text/image) and text output. The 27B version significantly reduces memory requirements through quantization-aware training.
Image-to-Text
Safetensors
G
gaunernst
789
4
Gemma 3 12b It Qat Autoawq
Gemma 3 is Google's lightweight open model series based on Gemini technology, supporting multimodal input and text output.
Image-to-Text
Safetensors
G
gaunernst
498
3
Gemma 3 27b It Qat Q4 0 Gguf
Gemma 3 is a lightweight open-source multimodal model series by Google, supporting text and image inputs with text generation capabilities. This version is a 27B parameter instruction-tuned model using quantization-aware training, offering lower memory requirements while maintaining near-original quality.
Image-to-Text
G
vinimuchulski
4,674
6
Gemma 3 12b It Qat Q4 0 Gguf
Gemma 3 is a lightweight open model built by Google based on Gemini technology, supporting text and image inputs to generate text outputs. The 12B version is instruction-tuned and suitable for various generation and comprehension tasks.
Image-to-Text
G
vinimuchulski
1,860
4
Openhands Lm 32b V0.1 AWQ
MIT
OpenHands LM is a 32B-parameter open-source programming model, specifically designed for software development agents. It supports local deployment and excels in software engineering tasks.
Large Language Model
Safetensors English
O
stelterlab
2,635
8
Gemma 3 4b It Llamafile
Gemma 3 is a lightweight open-source model series launched by Google, built on Gemini technology, supporting multimodal input and text output.
Text-to-Image
G
Mozilla
751
3
Gemma 3 27b It Int4 Gguf
Gemma 3 is a lightweight cutting-edge open model family from Google, built on the same research technology as Gemini models. Supports text/image input and text output, offering both pretrained and instruction-tuned weight versions.
Image-to-Text
G
gaunernst
232
3
Gemma 3 12b It Int4 Gguf
Gemma 3 is a lightweight multimodal open model from Google that supports text and image inputs with text outputs, featuring a 128K large context window and support for 140+ languages.
Image-to-Text
G
gaunernst
107
1
Gemma 3 12b It Int4 Awq
Gemma is Google's lightweight cutting-edge open-source model family, built using the same research technology as Gemini models. Gemma 3 is a multimodal model supporting text/image input and text output.
Image-to-Text
Transformers

G
gaunernst
4,658
9
Gemma 3 12b Pt Unsloth Bnb 4bit
Gemma 3 is a lightweight, advanced open model series launched by Google, built on the same research technology as Gemini, supporting multimodal input and text output.
Text-to-Image
Transformers English

G
unsloth
1,286
1
Gemma 3 12b It Gguf
Gemma-3 is a lightweight multimodal open model launched by Google, supporting text and image inputs to generate text outputs. Built on the research and technology behind the Gemini model, it features a 128K large context window and supports over 140 languages.
Image-to-Text
G
Mungert
4,574
11
Gemma 3 12b Pt Qat Q4 0 Gguf
Gemma 3 is a lightweight open-source multimodal model from Google, supporting text and image input with text output, featuring a 128K ultra-long context window and support for 140+ languages.
Image-to-Text
G
google
475
12
Gemma 3 4b It Gguf
Gemma 3 is a lightweight open-source multimodal model introduced by Google, supporting image and text inputs to generate text outputs.
Image-to-Text
G
Mungert
4,593
9
Gemma 3 4b Pt Qat Q4 0 Gguf
Gemma 3 is a lightweight open model series launched by Google, built on the same technology as Gemini, supporting multimodal input and text output.
Image-to-Text
G
google
912
16
Gemma 3 12b It Qat Q4 0 Gguf
Gemma 3 is Google's lightweight cutting-edge open-source multimodal model supporting image-text input and text output, featuring a 128K context window and 140+ language support.
Image-to-Text
G
google
40.86k
109
Gemma 3 1b Pt Qat Q4 0 Gguf
Gemma is a family of lightweight, cutting-edge open models from Google, built on the same research and technology as the Gemini models. The 1B version is a pretrained base model in GGUF format with Quantization-Aware Training (QAT).
Image-to-Text
G
google
97
6
Gemma 3 12b It GGUF
Gemma 3 is a lightweight open-source multimodal model series launched by Google, built on the same technology as Gemini, supporting text and image inputs and generating text outputs
Image-to-Text
G
ggml-org
8,110
23
Gemma 3 4b It GGUF
Gemma 3 is a lightweight open-source multimodal model from Google, supporting text and image inputs with text outputs, featuring a 128K context window and support for 140+ languages.
Image-to-Text
G
ggml-org
9,023
25
Gemma 3 12b It
Gemma is a lightweight cutting-edge open-source multimodal model series launched by Google, built on the technology used to create Gemini models, supporting text and image inputs to generate text outputs.
Image-to-Text
Transformers

G
google
364.65k
340
Gemma 3 27b It
Gemma is a lightweight cutting-edge open model series launched by Google, built on the same technology as Gemini, supporting multimodal input and text output.
Image-to-Text
Transformers

G
google
371.46k
1,274
Gemma 3 27b Pt
Gemma is a series of lightweight, cutting-edge open models launched by Google, built on the same research and technology used to create Gemini models.
Image-to-Text
Transformers

G
google
13.27k
92
Phi 4 Mini Instruct Abliterated
MIT
Phi-4-mini-instruct is a lightweight open-source model built on synthetic data and curated public websites, focusing on high-quality data with strong reasoning capabilities. It supports a 128K token context length and is enhanced through supervised fine-tuning and direct preference optimization to ensure precise instruction following and safety.
Large Language Model
Transformers Supports Multiple Languages

P
lunahr
250
8
Gemma 3 1b Pt
Gemma is a series of lightweight, advanced open models from Google, built using the same research and technology as the Gemini models.
Text-to-Image
Transformers

G
google
171.13k
108
Gemma 3 4b It
Gemma is a lightweight, advanced open model series launched by Google, built on the same research and technology as Gemini. Gemma 3 is a multimodal model capable of processing both text and image inputs to generate text outputs.
Image-to-Text
Transformers

G
google
608.22k
477
- 1
- 2
Featured Recommended AI Models